The linear transformation of LF glottal waveforms for voice conversion
نویسندگان
چکیده
Most Voice Conversion (VC) systems exploit source-filter decomposition based on linear prediction (LP) to transform spectral envelopes, incurring as a result various issues related to the oversimplification of the LP voice source model. Whilst residual prediction methods can mitigate this problem, they cannot be used to modify voice source quality. In this paper, a system which employs linear transformations to convert both the spectral envelope and the LF glottal waveform is presented. Its performance is shown to be comparable to that of a state-of-theart VC implementation in terms of speaker identity conversion but its output has better quality. In addition, it is also capable of transforming the quality of the voice source.
منابع مشابه
Adding Glottal Source Information to Intra-Lingual Voice Conversion
This paper studies the inclusion of glottal source characteristics in voice conversion (VC) systems. We use source/filter decomposition to parametrize the vocal tract using LSF, the glottal source using the LF model, and the aspiration noise using amplitude-modulated high-pass filtered AWGN noise. To evaluate the impact of this new parametrization in VC, we use a reference conversion system tha...
متن کاملAutomatic estimation of voice source parameters
Voice source parameters can be estimated by fitting a voice source model to the glottal flow signal which is obtained by means of inverse filtering. In this pap er we investigate the behaviour of the LF-model in a numb er of non-linear parameter estimation procedures. It is concluded that (1) the parameter estimates are robust against additive (white and narrow band) noise in the flow waveforms...
متن کاملA Review of Glottal Waveform Analysis
Glottal inverse filtering is of potential use in a wide range of speech processing applications. As the process of voice production is, to a first order approximation, a source-filter process, then obtaining source and filter components provides for a flexible representation of the speech signal for use in processing applications. In certain applications the desire for accurate inverse filterin...
متن کاملVoice conversion based on parameter transformation
This paper describes a voice conversion system based on parameter transformation [1]. Voice conversion is the process of making one person’s voice “source” sound like another person’s voice “target”[2]. We will present a voice conversion scheme consisting of three stages. First an analysis is performed on the natural speech to obtain the acoustical parameters. These parameters will be voiced an...
متن کاملTowards an improved modeling of the glottal source in statistical parametric speech synthesis
This paper proposes the use of the Liljencrants-Fant model (LFmodel) to represent the glottal source signal in HMM-based speech synthesis systems. These systems generally use a pulse train to model the periodicity of the excitation signal of voiced speech. However, this model produces a strong and uniform harmonic structure throughout the spectrum of the excitation which makes the synthetic spe...
متن کامل